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Abstract 

Quadratic Programming (QP) is the well-studied problem of maximizing over {—1, 1} values 
the quadratic form X^j^j o-ijXiXj. QP captures many known combinatorial optimization problems, 
and assuming the unique games conjecture, semidefinite programming techniques give optimal 
approximation algorithms. We extend this body of work by initiating the study of Quadratic 
Programming problems where the variables take values in the domain {—1,0,1}. The specific 
problems we study are 

QP-Katio : max — ^ , and Normalized QP-Ratio : max — ^ ^ . 

{-1,0,1}" l^xt {-1,0,1}" l^a.xf 

where di = ja^ | 
j 

These are natural relatives of several well studied problems (in fact Trevisan introduced the 
latter problem as a stepping stone towards a spectral algorithm for Max Cut Gain). These 
quadratic ratio problems are good testbeds for both algorithms and complexity because the tech- 
niques used for quadratic problems for the {—1,1} and {0, 1} domains do not seem to carry over 
to the {—1,0,1} domain. We give approximation algorithms and evidence for the hardness of 
approximating these problems. 

We consider an SDP relaxation obtained by adding constraints to the natural eigenvalue (or 
SDP) relaxation for this problem. Using this, we obtain an 0(n^/'^) algorithm for QP-ratio. We 
also obtain an 0{n^^'^) approximation for bipartite graphs, and better algorithms for special cases. 

As with other problems with ratio objectives (e.g. uniform sparsest cut), it seems difficult 
to obtain inapproximability results based on P ^ NP. We give two results that indicate that 
QP-Ratio is hard to approximate to within any constant factor: one is based on the assumption 
that random instances of Max fc-AND are hard to approximate, and the other makes a connection 
to a ratio version of Unique Games. 

There is an embarrassingly large gap between our upper bounds and lower bounds. In fact, we 
give a natural distribution on instances of QP-Ratio for which an approximation (for small s) 
seems out of reach of current techniques. On the one hand, this distribution presents a concrete 
barrier for algorithmic progress. On the other hand, it is a challenging question to develop lower 
bound machinery to establish a hardness result of n'^ for this problem. 
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1 Introduction 



Semidefinite programming techniques have proved very useful for quadratic optimization problems 
(i.e. problems with a quadratic objective) over {0, 1} variables or {±1} variables. Such problems 
admit natural SDP relaxations and beginning with the seminal work of Goemans and Williamson 
[GW95], sophisticated techniques have been developed for exploiting these SDP relaxations to obtain 
approximation algorithms. For a large class of constraint satisfaction problems, a sequence of exciting 
results [KKMO07, KO06, KV05] culminating in the work of Raghavendra[Rag08], shows that in fact, 
such SDP based algorithms are optimal (assuming the Unique Games Conjecture). 

In this paper, we initiate a study of quadratic programming problems with variables in {0, ±1}. 
In contrast to their well studied counterparts with variable values in {0, 1} or {±1}, to the best of 
our knowledge, such problems have not been studied before. These problems admit natural SDP 
relaxations similar to problems with variable values in {0, 1} or {±1}, yet we know very little about 
how (well) these problems can be approximated. We focus on some basic problems in this class: 

QP-Ratio : max — — 7^ , and Normalized QP-Ratio : max — ^ 7^ — . (1) 

{-1,0,1}" 22 {-1,0,1}" 22diXi 

where di = \aij\ 
j 

Note that the numerator is the well studied quadratic programming objective 'Ylii<j'^i,j^i^j- 
Ignoring the value of the denominator for a moment, the numerator can be maximized by setting 
all variables to be ±1. However, the denominator term in the objective makes it worthwhile to set 
variables to 0. An alternate phrasing of the ratio-quadratic programming problems is the following: 
the goal is to select a subset of non-zero variables S and assign them values in {±1} so as to maximize 
the ratio of the quadratic programming objective 'Ylii<j&s '^ij^i^j to the (normalized) size of S. 

This problem is a variant of well studied problems: Eliminating as a possible value for variables 
gives rise to the problem of maximizing the numerator over {±1} variables - a well studied problem 
with an O(logn) approximation [NRT99, MegOl, CW04]. On the other hand, eliminating —1 as a 
possible value for variables (when the aij are non-negative) results in a polynomial time solvable 
problem. Another closely related problem to QP-Ratio is a budgeted variant where the goal is to 
maximize the numerator (for the QP-Ratio objective) subject to the denominator being at most 
k. This is harder than QP-Ratio in the sense that an a-approximation for the budgeted version 
translates to an a-approximation for QP-Ratio (but not vice versa). The budgeted version is a 
generalization of fc-Densest Subgraph, a well known problem for which there is a huge gap between 
current upper[BCC^10] and lower bounds[Kho04, Fei02]. In this paper, we chose to focus on the 
"easier" class of ratio problems. 

Though it is a natural variant of well studied problems, QP-Ratio seems to fall outside the realm 
of our current understanding on both the algorithmic and inapproximability fronts. One of the goals 
of our work is to enhance (and understand the limitations of) the SDP toolkit for approximation 
algorithms by applying it to this natural problem. On the hardness side, the issues that come up are 
akin to those arising in other problems with a ratio/expansion flavor, where conventional techniques 
in inapproximability have been ineffective. 

The Normalized QP-Ratio objective arose in recent work of Trevisan[Tre09] on computing Max 
Cut Gain using eigenvalue techniques. The idea here is to use the eigenvector to come up with a 
'good' partial assignment, and recurse. Crucial to this procedure is a quantity called the GainRatio 
defined for a graph; this is a special case of Normalized QP-Ratio where aij = — 1 for edges, and 
otherwise. 
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1.1 Our results 

We first study mathematical programming relaxations for QP-Ratio. The main difficulty in obtaining 
such relaxations is imposing the constraint that the variables take values {—1,0, 1}. Capturing this 
using convex constraints is the main challenge in obtaining good algorithms for the problem. 

We consider a semidefinite programming (SDP) relaxation obtained by adding constraints to 
the natural eigenvalue relaxation, and round it to obtain an 0{n^^^) approximation algorithm. An 
interesting special case is bipartite instances of QP-Ratio, where the support of aij is the adjacency 
matrix of a bipartite graph (akin to bipartite instances of quadratic programming, also known as the 
Grothendieck problem). For bipartite instances, we obtain an 0(n^/^) approximation and an almost 
matching SDP integrality gap of Q(n^^'^). 

Our original motivation to study quadratic ratio problems was the GainRatio problem studied 
in Trevisan [Tre09]. We give a sharp contrast between the strengths of different relaxations for 
the problem and disprove Trevisan's conjecture that the eigenvalue approach towards Max Cutgain 
matches the bound achieved by an SDP-based approach [CW04]. See Section 3 for details. 

Complementing our algorithmic result for QP-Ratio, we show hardness results for the problem. 
We first show that there is no PTAS for the problem assuming P ^ NP. We also provide evidence 
that it is hard to approximate to within any constant factor. We remark that current techniques 
seem insufficient to prove such a result based on standard assumptions (such as -P 7^ NP) ~ a similar 
situation exists for other problems with a ratio objective such as sparsest cut. 

In Section 4.2 we rule out constant factor approximation algorithms for QP-Ratio assuming that 
random instances of A;- AND are hard to distinguish from 'well-satisfiable' instances. This hypothesis 
was used as a basis to prove optimal hardness for the so called 2-Catalog problem (see [Fei02]) 
and has proven fruitful in ruling out 0(l)-approximations for the densest subgraph problem (see 
[AAM"^!!]). It is known that even very strong SDP relaxations (in particular, Q{n) rounds of the 
Lasserre hierarchy) cannot refute this conjecture [Tul09]. 

We also show a reduction from Ratio UG (a ratio version of the well studied unique games 
problem), to QP-Ratio. We think that ratio version of Unique Games is an interesting problem 
worthy of study that could shed light on the complexity of other ratio optimization questions. The 
technical challenge in our reduction is to develop the required fourier-analytic machinery to tackle 
PCP-based reductions to ratio problems. 

There is a big gap in the approximation guarantee of our algorithm and our inapproximability 
results. We suspect that the problem is in fact hard to approximate to an factor for some e > 0. In 
Section 4.1, we decribe a natural distribution over instances which we believe are hard to approximate 
up to polynomial factors. Our reduction from /c-AND in fact generates hard instances of a similar 
structure albeit ruling out only constant factor approximations. 



2 Algorithms for QP-Ratio 

We start with the most natural relaxation for QP-Ratio (1) : 

max — K subject to Xi £ [—1, 1 

(instead of {0, ±1}). The solution to this is precisely the largest eigenvector of A (scaled such that 
entries are in [—1, 1]). However it is easy to construct instances for which this relaxation is bad: if A 
were the adjacency matrix of a (n -|- 1) vertex star (with vq as the center of the star), the relaxation 
can cheat by setting xq = ^ and Xi = for i S [n] to give a gap of O(y^) (the integer optimum is 
at most 1). 
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We show that SDP relaxations give more power in expressing the constraints Xi G {0, ±1}? 
Consider the following relaxation: 

max Ajj ■ (w,j,Wj) subject to ^^wf = 1, and 

|(w„w,)| < w2 for alH,j (2) 

It is easy to see that this is indeed a relaxation: start with an integer solution {xj} with k non-zero 
Xi, and set Vj = (xi/^/k) ■ vq for a fixed unit vector vq. 

Without constraint (2), the SDP relaxation is equivalent to the eigenvalue relaxation given 
above. Roughly speaking, equation (2) tries to impose the constraint that non-zero vectors are of 
equal length. In the example of the (n + l)-vertex star, this relaxation has value equal to the true 
optimum. In fact, for any instance with Aij > for all this relaxation is exact [ChaOO]. 

There are other natural relaxations one can write by viewing the {0, ±1} requirement like a 3- 
alphabet CSP. We consider one of these in section 2.5, and show an r2(n^/^) integrality gap. It is 
interesting to see if lift and project methods starting with this relaxation can be useful. 

In the remainder of the section, we describe a simple 0(n^^^) rounding algorithm, which shows 
that the additional constraints (2) indeed help. We first describe an integrality gap of roughly n^^'^, 
as it highlights the issues that arise in rounding the SDP solution. 

2.1 Integrality gap instance 

Consider a complete bipartite graph on L,R, with \L\ = n^/^, and \R\ = n. The edge weights are set 
to ±1 uniformly at random. Denote by B the n^/^ x n matrix of edge weights (rows indexed by L 
and columns hy R). A standard Chernoff bound argument shows 

Lemma 2.1. With high probability over the choice of B, we have opt < ^/\ogn ■ n^^^. 

Proof. Let Si S2 ^ R be of sizes a, b respectively. Consider a solution in which these are 

the only variables assigned non-zero values (thus we fix some ±1 values to these variables). Let val 
denote the value of the numerator. By the Chernoff bound, we have 

P[val > c^/^] < e^"'/^ 

for any c > 0. Now choosing c = lOy^ (a + b) log n, and taking union bound over all choices for Si, S2 
and the assignment (there are i"^) {1)'^"'^'' choices overall), we get that w.p. at least 1 — no 
assignment with this choice of a and b gives val bigger than ab{a + b) logn. The ratio in this case 

is at most ^logn • < \/log n ■ v}!^. Now we can take union bound over all possible o and b, thus 
proving that opt < ri^l^ w.p. at least 1 — ■ 

Let us now exhibit an SDP solution with value n^/^. Let vi, V2, . . . , be mutually orthogonal 
vectors, with each v? = l/2n^/^. We assign these vectors to vertices in L. Now to the jth vertex in 
R, assign the vector Wj defined by = Bij-^. 

It is easy to check that = -7^ = Further, note that for any i,j, we have (since all Vj 

2 

are orthogonal) Bij{vi,Wj) = Bj- • ^ = This gives Bij{vi,Wj) = r?l'^ ■ (l/2n) = nV2/2. 

From these calculations, we have Vi, j, |vj • Wj| < w| (thus satisfying (2); other inequalities of this 
type are trivially satisfied). Further we saw that Y.i ^1 + Ej w| = 1. This gives a feasible solution 
of value fi(?i^/^). Hence the SDP has an J7(n^/^) integrality gap. 
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Connection to the star example. This gap instance can be seen as a collection of n^/^ stars 
(vertices in L are the 'centers'). In each 'co-ordinate' (corresponding to the orthogonal Vj), the 
assigment looks like a star. 0{y/n) different co-ordinates allow us to satisfy the constraints (2). 

This gap instance is bipartite. In such instances it turns out that there is a better rounding 
algorithm with a ratio <D{'n}/^) (Section 2.3). Thus to bridge the gap between the algorithm and the 
integrality gap we need to better understand non-bipartite instances. 

2.2 An 0(n^/^) rounding algorithm 

Consider an instance of QP-Ratio defined by ^(„xn)- Let Wj be an optimal solution to the SDP, and 
let the objective value be denoted sdp. We will sometimes be sloppy w.r.t. logarithmic factors in the 
analysis. 

Since the problem is the same up to scaling the Aij, let us assume that maxjj- \Aij\ = 1. There 
is a trivial solution which attains a value 1/2 (if i,j are indices with \Aij\ = 1, set Xi,Xj to be ±1 
appropriately, and the rest of the x's to 0). Now, since we are aiming for an 0{n^/^) approximation, 
we can assume that sdp > n^/^. 

As alluded to earlier (and as can be seen in the gap example), the difficulty is when most of the 
contribution to sdp is from non-zero vectors with very different lengths. The idea of the algorithm 
will be to move to a situation in which this does not happen. First, we show that if the vectors 
indeed have roughly equal length, we can round well. Roughly speaking, the algorithm uses the 
lengths ||vj|| to determine whether to pick z, and then uses the ideas of [CW04] (or the earlier works 
of [NRT99, MegOl]) apphed to the vectors 

Lemma 2.2. Given a vector solution {vj}, with G [t/A,t] for some r > and A > 1, we can 
round it to obtain an integer solution with cost at least sdp/(-v/Alogn). 

Proof. Starting with Vj, we produce vectors Wj each of which is either or a unit vector, such that 

r Eij^iji^i^^j) _ , Ei,jAij{wi,Wj) sdp 

Stated this way, we are free to re-scale the Vj, thus we may assume r = 1. Now note that once we have 
such Wj, we can throw away the zero vectors and apply the rounding algorithm of [CW04] (with a loss 
of an O(logn) approximation factor), to obtain a 0,zbl solution with value at least sdp/(-v/A log n). 
So it suffices to show how to obtain the Wj. Let us set (recall we assumed r = 1) 

_ J Vi/||vj||, with prob. ||vi|| 
I otherwise 

(this is done independently for each i). Note that the probability of picking i is proportional to the 
length of Vj (as opposed to the typically used square lengths, [CMM06] say). S ince — 0, we tLave 

The above proof only shows the existence of vectors Wj which satisfy the bound on the ratio. The 
proof can be made constructive using the method of conditional expectations, where we set variables 
one by one, i.e. we first decide whether to make wi to be a unit vector along it or the vector, 
depending on which maintains the ratio to be > ^ = Now, after fixing wi, we fix W2 similarly 
etc., while always maintaining the invariant that the ratio > 9. 
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At step i, let us assume that wi, . . . ,Wj_i have aheady been set to either unit vectors or zero 
vectors. Consider Vj and let Vj = Vj/||vj||. Wj = Vj w.p. pi = ||vj|| and w.p (1 — Pi). 

In the numerator, B = ^^y^j^i k^iO-jkiwj^Wk)] is contribution from terms not involving i. Also 
let Cj = J2k=/=i^ik'^k and let c'- = ^j^idji'Wj. Then, from equation 3 

^ EE^.;^.aj-fc(wj-,Wfe)] _ p,((v„Ci) + (c^, Vi) +B) + {I - Pi)B 



^[E,|w,P] P.(l + E,vJ|w,P)+(l-p.)(E,yJ|w,P)) 

Hence, by the simple fact that if c, d are positive and > 0, then either | > or ^ > 0, we see 
that either by setting Wj = Vj or Wj = 0, we get value at least 9. ■ 

Let us define the 'value' of a set of vectors {wj} to be val := S^^^^'^'^j) _ T^}xq Vj we start will 
have val = sdp. 

Claim 2.3. We can move to a set of vectors such that (a) val is at least sdp/2, (b) each non-zero 
vector Vj satisfies > 1/n, (c) vectors satisfy (2), and (d) Ei'^? — 2- 

The proof is by showing that very small vectors can either be enlarged or thrown away . 

Proof. Suppose < v,^ < 1/n for some i. If Si = AijVi • Vj < 0, we can set Vj = and improve 
the solution. Now if Si > 0, replace Vj by • (this only increases the value of ^ Ajj(vj, Vj)), 

and repeat this operation as long as there are vectors with v? < 1/n. Overall, we would only have 
increased the value of ^ AijVi ■ Vj, and we still have Yli ^i — 2- Further, it is easy to check that 
|(vj,Vj)| < V? also holds in the new solution (though it might not hold in some intermediate step 
above) . ■ 

The next lemma also gives an upper bound on the lengths - this is where the constraints (2) are 
crucial. 



Lemma 2.4. Suppose we have a solution of value Bn^ and Ei'^i — 2- '^'^^ move to a solution 

,2 



/^i ^1 

with value at least Bn^^ /2, and v? < 16/n^ for all i. 
Proof. Let v? > 16/n^ for some index i. Since |(vj, Vj)| < v|, we have that for each such i, 

^Aij(vi,Vj) < S^v^ < 25 

3 j 

Thus the contribution of such i to the sum j Aij (vj , Vj) can be bounded by m x 45, where m is the 
number of indices i with v? > 16/n''. Since the sum of squares is < 2, we must have m < n^/8, and 
thus the contribution above is at most Bn^ /2. Thus the rest of the vectors have a contribution at least 
sdp/2 (and they have sum of squared-lengths < 2 since we picked only a subset of the vectors) ■ 

Theorem 2.5. Suppose A is an n x n matrix with zero's on the diagonal. Then there exists a 
polynomial time <D{n^/^) approximation algorithm for the QP-Ratio problem defined by A. 

Proof. As before, let us rescale and assume maxi, j\Aij\ = 1. Now if p > 1/3, Lemmas 2.3 and 2.4 
allow us to restrict to vectors satisfying 1/n < v? < 4/n^, and using Lemma 2.2 gives the desired 
0(n^/^) approximation; if p < 1/3, then the trivial solution of 1/2 is an 0{n^^^) approximation. ■ 
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2.3 The bipartite case 

In this section, we prove the following theorem: 

Theorem 2.6. When A is bipartite (i.e. the adjacency matrix of a weighted bipartite graph), there 
is a (tight upto logarithmic factor) 0{n^^^) approximation algorithm for QP-Ratio . 

Bipartite instances of QP-Ratio can be seen as the ratio analog of the Grothendieck problem 
[AN06]. The algorithm works by rounding the semidefinite program relaxation from section 2. As 
before, let us assume maxjj aij = 1 and consider a solution to the SDP (2). To simplify the notation, 
let Ui and Vj denote the vectors on the two sides of the bipartition. Suppose the solution satisfies: 

(1) a,,{u,,v,)>n^, {2)Y,n'i = Y,^^,=^- 

If the second condition does not hold, we scale up the vectors on the smaller side, losing at most 
a factor 2. Further, we can assume from Lemma 2.3 that the squared lengths uf,Vj are between ^ 
and 1. Let us divide the vectors {ui} and {vj} into logn groups based on their squared length. There 
must exist two levels (for the u and u's respectively) whose contribution to the objective is at least 
n"/ log^ n.^ Let L denote the set of indices corresponding to these Ui, and R denote the same for vj. 
Thus we have Yli&L j&R'^iji'^ij''^j) — ?^°/log^n. We may assume, by symmetry that \L\ < \R\. Now 
since Vj < 1, we have that u| < l/\R\ for all j € R. Also, let us denote by Aj the |L|-dimensional 
vector consisting of the values a^j, i £ L. Thus 



We will construct an assignment Xi S {+1,-1} for i S L such that • J2j^R. \ Z^ieL^u^*! 
'large'. This suffices, because we can set yj G {+1, —1}, j & R appropriately to obtain the value above 
for the objective (this is where it is crucial that the instance is bipartite - there is no contribution 
due to other yj 's while setting one of them) . 

Lemma 2.7. There exists an assignment o/{+l,— 1} to the Xi such that 

j&R ieL jeR 
Furthermore, such an assignment can be found in polynomial time. 

Proof. The intuition is the following: suppose Xi,i E L are i.i.d. { + 1,-1} random variables. For 
each j, we would expect (by random walk style argument) that E [| X^ieL ^ij-^il] ~ ll^ilb, and thus 
by linearity of expectation, E 
follows. This can in fact be formalized: 



^jeR I SieL ^ ^jeRW^jh- Thus the existence of such Xj 



B[\YaijX,\]>\\Aj\\2/12 (5) 

This equation is seen to be true from the following lemma 

^Such a clean division into levels can only be done in the bipartite case - in general there could be negative 
contribution from 'within' the level. 
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Lemma 2.8. Let 61, . . . , 6„ G M with ^ ■ 6? = 1, and let Xi, . . . , X„ be i.i.d. {+1, —1} r.u.s. Then 

n\Y,biX^\] > 1/12. 

i 

Proof. Define the r.v. Z := J^- biXi. Because the X^ are i.i.d. {+1, -1}, we have EfZ^] = = 1. 

Further, E[Z^] = + 6 Ei<j ^^^^ < ^(Ei ^?)^ = 3- Thus by Paley-Zygmund inequality, 

^ -4^-16 E[Z4] - 16' 

Thus \Z\ > 1/2 with probability at least 3/16 > 1/6, and hence E[|Z|] > 1/12. ■ 

We can also make the above constructive. Let r.v. S := Eje/? I E/eL ^ non-negative 

random variable, and for every choice of Xi, we have 

jeR ieL j&R 

Let p denote P[5 < —^]- Then from the above inequality, we have that {1 — p) > ^^^172 • Thus if we 
sample the Xi say n times (independently), we hit an assignment with a large value of S with high 
probability. ■ 

Proof of Theorem 2.6. By Lemma 2.7 and Eq (4), there exists an assignment to Xj, and a correspond- 
ing assignment of {-|-1, — 1} to yj such that the value of the solution is at least 

Now if \L\ < 71^/2, we are done because we obtain an approximation ratio of 0{n^^'^log'^ n). On the 
other hand if > n^/^ then we must have \\ui\\2 < l/n^/^. Since we started with n? and vf being 
at least l/2n (Lemma 2.3) we have that all the squared lengths are within a factor 0(n^/2) of each 
other. Thus by Lemma 2.2 we obtain an approximation ratio of 0(n^/'^ log n). This completes the 
proof. ■ 



2.4 Algorithms for special cases 

2.4.1 Poly-logarithmic approximations for positive semidefinite matrices 

The MaxQP problem has a better approximation guarantee (of 2/7r) when A is psd. Even for the 
QP-Ratio problem, we can do better in this case than for general A. In fact, it is easy to obtain a 
polylog(n) approximation. 

This proceeds as follows: start with a solution to the eigenvalue relaxation (call the value p). 
Since A is psd, the numerator can be seen as Ej(-^«^)^) where Bi are linear forms. Now divide the 
Xi into O(logn) levels depending on their absolute value (need to show that Xi are not too small - 
poly in 1/n, l/\A\oo). We can now see each term BiXi a sum of O(logn) terms (grouping by level). 
Call these terms Cj,..., Cf, where I is the number of levels. The numerator is upper bounded by 
•^(Ei EjC^/)^)' thus there is some j such that 'Yliii^D'^ least l/log^ n times the numerator. 
Now work with a solution y which sets yi = Xi if Xj is in the j'th level and otherwise. This is a 
solution to the ratio question with value at least p/l^. Further, each \yi\ is either or in [p,2p\, for 
some p. 
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From this we can move to a solution with \yi\ either or 2p as fohows: focus on the numerator, and 
consider some Xj 7^ with \xi\ < 2p (strictly). Fixing the other variables, the numerator is a convex 
function of Xi in the interval [— 2p, 2p] (it is a quadratic function, with non-negative coefficient to the 
xf term, since A is psd). Thus there is a choice of Xi = ±2^ which only increases the numerator. 
Perform this operation until there are no 7^ with < 2p. This process increases each \xi\ by a 
factor at most 2. Thus the new solution has a ratio at least half that of the original one. Combining 
these two steps, we obtain an 0(log^ n) approximation algorithm. 



2.4.2 Better approximations v^hen the optimum is large. 

We can also obtain a much better approximation algorithm for QP-Ratio when the maximum value 
of the instance is large, say edmax, where dmax = maxj |ajj|. For QP-Ratio instances A with 
OPT{A) > ed 

max 7 we can find a solution of value e ^^^^^^ dmax using techniques from section 3. 
This is because when all the degrees di are roughly equal (say ^dmax < di < dmax for some 
constant 7 > 0), then it is easy to check that an 0{a) approximation to Normalized QP-Ratio (de- 
fined in section 3) is an 0(0/7) approximation to the same instance of QP-Ratio. Further, when 
OPr(QP-Ratio) > ed 

max 7 we can throw away vertices i of degree di < ^dmax without losing in the 
objective. Hence, for a QP-Ratio instance A when OPT{A) > edmax, we can find a solution to 
QP-Ratio of value e-'^'^'^^'^Umax- 



2.5 Other Relaxations for QP-Ratio 

For problems in which variables can take more than two values (e.g. CSPs with alphabet size r > 2), 
it is common to use a relaxation where for every vertex u (assume an underlying graph), we have 

variables Xu \ ■■,Xu \ and constraints such as {xu\xu^) = and J2i{^u\xu^) = 1 (intended solution 
being one with precisely one of these variables being 1 and the rest 0). 

We can use such a relaxation for our problem as well: for every Xj, we have three vectors aj,6j, 
and Cj, which are supposed to be 1 if = 0, 1, and —1 respectively (and otherwise). In these terms, 
the objective becomes 

'^Aij{bi,bj) - {bi,Cj) - {ci,bj) + {ci,Cj) = '^Aij{bi - Ci,bj - cj). 
The following constraints can be added 

T.'^ + cf = l (6) 

i 

{ai,bj),{bi,Cj),{ai,Cj) > for ah i,j (7) 

{ai,aj),{bi,bj),{ci,Cj) > for ah i,j (8) 

{ai,bi) = {bi, a) = {ai, a) = (9) 

aj + bf + cj = l for all i (10) 

Let us now see why this relaxation does not perform better than the one in (2). Suppose we start with 
a vector solution Wj to the earlier program. Suppose these are vectors in W^. We consider vectors 
in M"'"'"'^"'"^, which we define using standard direct sum notation (to be understood as concatenating 
co-ordinates). Here Cj is a vector in M" with 1 in the ith position and elsewhere. Let 0^ denote the 
vector in W. 
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We set (the last term is just a one-dim vector) 

bi = 0„ e wi/2e (|w»l/2) 

Q = 0„ e -w»/2© (|w,|/2) 
Oi = - w2 • Ci © Od © (0) 

It is easy to check that {ai,bj) = {ai,Cj) = 0, and {bi,Cj) = 1/4- (— (wj,Wj) + |wj||wj|) > for all i,j 
(and for i = j, {bi,Ci) = 0). Also, bf+cf = = 1-af. Further, {bi,bj) = V4-((wi, Wj) + |wi||wj|) > 
0. Last but not least, it can be seen that the objective value is 

"^Aijibi - Ci,bj - Cj) = y^^Aij{wi,Wj), 

as desired. Note that we never even used the inequalities (2), so it is only as strong as the eigenvalue 
relaxation (and weaker than the sdp relaxation we consider). 

Additional valid constraints of the form Oj + 6^ + Cj = vq (where vq is a designated fixed vector) can 
be introduced - however it it can be easily seen that these do not add any power to the relaxation. 



3 Normalized QP-Ratio 

Given any symmetric matrix A, the normalized QP-Ratio problem aims to find the best {—1,0, 1} 
assignment which maximizes the following: 

max T u ■? (11) 

xe{-i,o,i}" E»^i Klixi + xj) 

= — ; — Ty where di = |aj, | are "the degrees" 

Note that when the degrees di are all equal {di = d Vi), this is the same as QP-Ratio upto 
a scaling. Though the two objectives have a very similar flavor, the normalized objective tends to 
penalize picking vertices of high degree in the solution. 

This problem was recently considered by Trevisan [Tre09] in the special case when A = —W{G) 
where W{G) are the matrix of edge weights (0 if there is no edge) and called this quantity the 
GainRatio of G. He gave an algorithm for Max Cut-Gain which uses GainRatio as a subroutine, 
based purely on an eigenvalue relaxation (as opposed to the SDP-based algorithm of [CW04]). His 
algorithm for GainRatio can also be adapted to give an algorithm for Normalized QP-Ratio with a 
similar guarantee. We sketch it below. 



3.1 Algorithm based on [Tre09] 

Consider the natural relaxation 

x^Ax 
xeT-T,i]" J2i dix'i 

This is also the maximum eigenvalue of D~^/'^ AD^/"^ where D is the diagonal matrix of degrees. 
Trevisan [Tre09] gave a randomized rounding technique which uses just threshold cuts to give the 
following guarantee. 



max ^ ^ — 2 (12) 
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Lemma 3.1. [Tre09j In the notation stated above, for every 7 > 0, there exists ci,C2 > with 
C1C2 < 76"^/'^, such that given any a; G M" s.t. x^Ax > ex^Dx, outputs a distribution over discrete 
vectors { — 1,0, 1}" (using threshold cuts) with the properties: 

1. \ci^YiYj - XiXj\ < -i{xj + x"]) 

2. ^\Yi\< 02x1 

Proposition 3.2. Given a Normalized QP-Ratio instance A with value at least e finds a solution 
y G {-1,0,1}" of value e'^^'^/'^h 

Proof. For the eigenvalue relaxation equation 12, there is a feasible solution x such that x^Ax > 
ex^Dx where Dii = \aij\ and Dij = for i 7^ j. Now applying Lemma 3.1, we have 

BlaijYiVj] > —{aijXiXj - 710^^1(3;^ + xj)) 
Bl^QijYiYj] > — {x^Ax - 2'yx^Dx) 

> ^'-^^\ x^Dx) 

Cl 

Also, E[^. < C2X^Dx. Hence, there exists some vector y G {—1, 0, 1}*^ of value {e — 2^)/ciC2, 

which shows what we need for sufficiently small 7 < e/2. As in previous section 2, this can also 
be derandomized using the method of conditional expectations (in fact, since the distribution is just 
over threshold cuts, it suffices to run over all n threshold cuts to find the vector y). ■ 



3.2 Eigenvalue relaxation for Max Cut-Gain 

As mentioned earlier Trevisan [Tre09] shows that if the eigenvalue is e, the GainRatio is at least 
g-0(i/e). He also conjectures that there could a better dependence: that the GainRatio is at least 
e/log(l/e), whenever eigenval = e. This would give an eigenvalue based algorithm which matches the 
SDP-based algorithm of [CW04]. We show that this conjecture is false, and describe an instance for 
which eigenval is e, but the GainRatio is at most exp(— 1/e^/^). This shows that the eigenvalue based 
approach is necessarily 'exponentially' weaker than an SDP-based one. Roughly speaking, SDPs are 
stronger because they can enforce vectors to be all of equal length, while this cannot be done in an 
eigenvalue relaxation. First, let us recall the eigenvalue relaxation for Max CutGain 

J2{u,v}&E(G) —WuvXuXv 

Eig = max -^^^ j --^5 ^. 

^..6[-l,l] 2^{u,v}(^E{G) \Wuv\[xi + xi) 

Description of the instance. Li what follows, let us fix e to be a small constant, and write 
M = 1/e (thought of as an integer), and m = 2/e. 

The vertex set is F = 14 U V2 U • • • U Vm, where |Vi| = M*. We place a clique with edge weight 1 
on each set V^. Between Vi and Fj+i, we place a complete bipartite graph with edge-weight (1/2 + 
We will call Vi the ith. level. Thus the total weight of the edges in the ith level is roughly M'^'^ /2, and 
the weight of edges between levels i and {i + 1) is (1/2 + e)M^*+^. 

Lemma 3.3. There exist Xi £ [-1, 1] such that j^^"'">^^<°> """rTf = ^(e^). 
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Proof. Consider a solution in which vertices u in level i have Xu = (— l)*e*. We have 



-^ + (m-l)(i+£) 



> — ^ (noting Me = 1) 

fm 



2 2 
) (setting m -) 



Let us now prove an upper bound on the GainRatio of G. Consider the optimal solution Y . Let 
the fraction of vertices u in level i with non-zero Yu be Aj. Of these, suppose (^ + fraction have 
Yy. = +1 and (^ — "^/i) have 1^ = —1. It is easy to see that we may assume ?7j's alternate in sign Thus, 
for convenience, we will let r]i denote the negated values for the alternate levels and treat all r^j's as 
positive. 

With these parameters, we see that 

rll 11 1 1 

Numerator = ^ M^Uf - ^(^ + mf "0(0" + + m) " 



i=l 

m—1 



+ E + [(i + (i + r?i+i) + (i - r?,) (i - 77^+1 



'2 
=1 



2 ^^2 ^ ^2 ^^2 
- + ^i) - Vi+i) -{\- Vi) il + m+i) 

rn m—1 

= 2(^ -M^^AfT]^ + (1 + 2e) M^'+^XiXi+,rjir]i+, 

i=l i=l 

Hence the numerator is 

m m—1 

Numerator = 2^^ -M^^A^r?^ + (1 + 2e) ^ M^'+^ \iXi+ii]i7]i+i^ (13) 

1=1 i=l 

Note that the denominator is at least Y^,- AjAf ^* (there is a contribution from every edge at least one 
end-point of which has Y nonzero). We will in fact upper bound the quantity Numerator/2 XirjiM^. 
This clearly gives an upper bound on the ratio we are interested in (as the rji are smaller than 1/2). 
Let us write 7^ = AjT/j. We are now ready to prove the theorem which implies the desired gap. A 
simple inequality useful in the proof is the following (it follows from the well-known fact that the 
largest eigenvalue of the length n path is cos(^^) 1 — ^): 

Vn > 1 and Xi e R, xj + x^ + . . . x^ > (iH 7^) (xiX2 + X2X3 + . . . Xn-iXn) (14) 

Theorem 3.4. Let 'ji > be real numbers in [0, 1], and let e,M,m be as before. Then 
- TZi llM^' + (1 + 2e) T.T=-i 7a.+iM2^+i ^ 1 



ET=1 7iM2i M^/A 
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Proof. Consider the numbers 7jM* and let r be the index where it is maximized. Denote this 
maximum value by D. 

Claim. Suppose 1 < j < m and j [r - + ^] and jjM^ > jj^- Then (15) holds. 

Suppose first that j > r + "^^^ . The numerator numerator of (15) is at most x 3m while the 
denominator is at least 'jjM^^ > x > x M'"+3v^ > x Afv^/^. (the last 

inequality is because D < M^, since < !)• This implies that the ratio is at most -^-^yj, and hence 
(15) holds. 

Next, suppose j < r — "^J . This means, since 7j < 1, that < Af Thus the numerator of 

(15) is bounded from above by D x 3m as above, while the denominator is at least jr^'^^ = -D^/7r > 
j-)2-^y/m/i ^ Thus the ratio is at most -j-j^^kJIi proving the claim. 

Thus for all indices j ^[r — \\/m, r + ^y^] (let us call this interval X), we have •jjM^ < j-^^j^. 
We thus split the numerator of (15) as 



- ^ 7^ Af^* + (1 + 2e) • ^ 7i7i+i Ar^'+^^ + remaining terms 

Note that the part in the parenthesis is < by suitable application of Eq. (14), while the remaining 
terms are each smaller than jj^i^ ■ Thus the numerator is < m ^^^^^ ■ Note that the denominator 
(of (15)) is at least D^. These two together complete the proof of the theorem. ■ 

Moving to an unweighted instance Now we will show that by choosing Nq large enough (recall 
that we chose Vi of size NqM^), we can bound how far cuts are from expectation. Let us start with 
a simple lemma. 

Lemma 3.5. Suppose A and B are two sets of vertices with m and n vertices resp. Suppose each 
edge is added independently at random w.p. (i + e). Then 

P [I # Edges ~ ( 2 + £)'mn\ > t^/mn] < e~* 
Proof. Follows from Chernoff bounds (concentration of binomial r.v.s) ■ 

Next, we look at an arbitrary partitioning of vertices with Aj and Ei values as defined previously 
(Aj of the y's nonzero, and (^ + Si) of them of some sign). 

Let us denote Ui = NqX^M'^. Between levels i and i + 1, the probability that the number of edges 
differs from expectation by (njnj+i)^/^ is at most (by the lemma above) g-C^i^i+i)^''^ . By choosing Nq 
big enough (say M^^"^) we can make this quantity smaller than g"^^™' (since each ui is at least Af^"*). 
Thus the probability that the sum of the 'errors' over the m levels is larger than Err = ^(njnj+i)^/^ 
is at most me"^^*". 

The total number of vertices in the subgraph (with Y nonzero) is at most NqM"^. Thus the 
number of cuts is 2^°*^'" < e^^™. Thus there exists a graph where none of the cuts have sum of the 
'errors' as above bigger than Err. 

Now it just remains to bound ^'^'.'^^-iijv/at ■ Here again the fact that Nq is big comes to the rescue 

9/0 

(there is only a Nq in the numerator) and hence we are done. 

4 Hardness of Approximating QP-Ratio 

Given that our algorithmic techniques give only an n^^^ approximation in general, and the natural 
relaxations do not seem to help, it is natural to ask how hard we expect the problem to be. Our 
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results in this direction are as follows: we show that the problem is APX-hard (i.e., there is no PTAS 
unless P = NP). Next, we show that there cannot be a constant factor approximation assuming 
that Max A:- AND is hard to approximate 'on average' (related assumptions are explored in [Fei02]). 
Our reduction therefore gives a (fairly) natural hard distribution for the QP- Ratio problem. 



4.1 Candidate Hard Instances 

As can be seen from the above, there is an embarrassingly large gap between our upper bounds and 
lower bounds. We attempt to justify this by describing a natural distribution on instances we do not 
know how to approximate to a factor better than (for some fixed 6 > 0). 

Let G denote a bipartite random graph with vertex sets Vl of size n and Vr of size n^/^, left 
degree for some small 5 (say 1/10) [i.e., each edge between Vl and Vr is picked i.i.d. with prob. 
y^-(9/io)j^ Next, we pick a random (planted) subset Pl of Vl of size n'^^^ and random assignments 
PL '■ Pl ^ {+1, —1} and pR : Vr ^ {+1, — !}• For an edge between i £ Pl and j G Vr, the weight 
o-ij := PL{i)PR{3)- For all other edges we assign aij = ±1 independently at random. 

The optimum value of such a planted instance is roughly n^, because the assignment of pliPr 
(and assigning to \ Pl) gives a solution of value . However, for J < 1/6, we do not know how 
to find such a planted assignment: simple counting and spectral approaches do not seem to help. 

Making progress on such instances appears to be crucial to improving the algorithm or the hard- 
ness results. We remark that the instances produced by the reduction from Random /c-AND are 
in fact similar in essence. We also note the similarity to other problems which are beyond current 
techniques, such as the Planted Clique and Planted Densest Subgraph problems [BCC'''10]. 



4.2 Reduction from Random /c-AND 

We start out by quoting the assumption we use. 

Conjecture 4.1 (Hypothesis 3 in [Fei02]). For some constant c > 0, for every k, there is a Aq, such 
that for every A > Aq, there is no polynomial time algorithm that, on most k-AND formulas with 
n-variables and m = An clauses, outputs 'typical ' , but never outputs 'typical ' on instances with 
ra/2'^^ satisfiable clauses. 

The reduction to QP-Ratio is then as follows: Given a fc-AND instance on n variables X = 
{xi, X2, . . . Xn} consisting of m clauses C = {Ci, C2, . . . Cm}, and a parameter < a < 1, let ^ = {oij} 
denote the mx n matrix such that Oij is 1/m if variable Xj appears as is in clause Cj, Oij is —1/m if 
it appears negated and otherwise. 

Let / : X — )• {— 1, 0, 1} and g : C ^ {~^-> 0) 1} denote functions which are supposed to correspond 
to assignments. Let pf = X^ieN 1/(^^)1/^ ^^'^^ = Y.jem \9{Cj)\/m. Let 

Hf,9) = — : • (16) 

apf + pg 

Observe that if we treat f{),g{) as variables, we obtain an instance of QP-Ratio [as described, 
the denominator is weighted; we need to replicate the variable set X aA times (each copy has same 
set of neighbors in C) in order to reduce to an unweighted instance - see Appendix C for details]. 
We pick a = 2~^^ and A a large enough constant so that Conjecture 4.1 and Lemmas 4.3 and 4.4 
hold. The completeness follows from the natural assignment . 

Lemma 4.2 (Completeness). // the k-AND instance is such that an a fraction of the clauses can be 
satisfied, then there exists function f , g such that 9 is at least k/2. 
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Proof. Consider an assignment that satisfies an a fraction of the constraints. Let / be such that 
/(xj) = 1 if Xj is true and —1 otherwise. Let g be the indicator of (the a fraction of the) constraints 
that are satisfied by the assignment. Since each such constraint contributes k to the sum in the 
numerator, the numerator is at least ak while the denominator 2a. ■ 

Soundness: We will show that for a typical random /c-AND instance (i.e., with high probabihty), 
the matrix A is such that the maximum value '&{f,g) can take is at most o{k). 

Let the maximum value of obtained be i^max- We first note that there exists a solution f,g of 
value '&max/'^ such that the equality afif = fig holds^ - so we only need consider such assignments. 

Now, the soundness argument is two-fold: if only a few of the vertices (X) are picked (/U/ < ^) 
then the expansion of small sets guarantees that the value is small (even if each picked edge contributes 
1). On the other hand, if many vertices (and hence clauses) are picked, then we claim that for every 
assignment to the variables (every /), only a small fraction (2~'^(^)) of the clauses contribute more 
than /c"^/^ to the numerator. 

The following lemma handles the first case when /U/ < a/400 . 

Lemma 4.3. Let k be an integer, < 6 < 1, and A be large enough. If we choose a bipartite graph 
with vertex sets X,C of sizes n,An respectively and degree k (on the C-side) uniformly at random, 
then w.h.p., for every T C X, S C C with \T\ < na/400 and \S\ < a\T\, we have \E{S,T)\ < \/^\S\. 

Proof. Let fi := \T\/\X\ (at most a/400 by choice), and m = An. Fix a subset S* of C of size 
a/im and a subset T of X of size /in. The expected number of edges between S and T in G is 
'E[E{S,T)] = kfj, ■ \S\. Thus, by Chernoff-type bounds (we use only the upper tail, and we have 
negative correlation here), 

P[E{S,T) > Vk\S\] < exp ( - ^^^^) < exp (-am/10) 

The number of such sets S',r is at most 2" x Ylt^T^'^^'^ (?) < 2"2^(°'/400)m < 2"+°"V20_ Union 
bounding and setting m/n > 20/a gives the result. ■ 

Now, we need to bound i?(/, g) for solutions such that oilJLf = Hg> aV400. We use the following 
lemma about random instances of /c-AND . 

Lemma 4.4. For large enough k and A, a random k-AND instance with An clauses on n variables 

_fc3/4 

is .such that: for any assignment, at most a 2 loo fraction of the clauses have more than k/2 + k'^'^ 
variables 'satisfied' [i.e. the variable takes the value dictated by the AND clause] w.h.p. 

Proof. Fix an assignment to the variables X. For a single random clause C, the expected number 
of variables in the clause that are satisfied by the assignment is k/2. Thus, the probability that the 
assignment satisfies more than fc/2(l -|- 5) of the clauses is at most exp(— (5^A:/20). Further, each 
fc-AND clause is chosen independently at random. Hence, by setting 5 = /c"^/* and taking a union 
bound over all the 2" assignments gives the result (we again use the fact that m ^ n/a). ■ 

Lemma 4.4 shows that for every {±1}" assignment to the variables x, at most 2~'^'^^^ fraction 
of the clauses contribute more than 2fc^/^ to the numerator of •d{f,g). We can now finish the proof 
of the soundness part above. 



■^For instance, if q/i/ > /ij,, we can always pick more constraints such that the numerator does not decrease (by 
setting g{Cj) = ±1 in a greedy way so as to not decrease the numerator) till figr — a^f, while losing a factor 2. Similarly 
for afif < fig, we pick more variables. 
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Proof of Soundness. Lemma 4.3 shows that when fXf < a/400, (7) = 0{^/k). For solutions such 
that Hf> a/400, i.e., > a'^/AOO = 2-^^/400, by Lemma 4.4 at most 2~'^(^) (< fig/k) fraction 
of the constraints contribute more than A;^/^ to the numerator. Even if the contribution is k [the 
maximum possible] for this small fraction, the value i}{f,g) < 0{k'^/^). ■ 

This shows a gap of k vs k"^/^ assuming Hypothesis 4.1. Since we can pick k to be arbitrarily 
large, we can conclude that QP- Ratio is hard to approximate to any constant factor. 



4.3 Reductions from Ratio versions of CSPs 

This section is inspired by the proof of [ABH^OS], who show that Quadratic Programming (QP) is 
hard to approximate by giving a reduction from Label Cover to QP-Ratio. 

Here we ask: is there a reduction from a ratio version of Label Cover to QP-Ratio? For this 
to be useful we must also ask: is the (appropriately defined) ratio version of Label Cover hard to 
approximate? The answer to the latter question is yes [see section A. 3 for details and proof that 
Ratio-LabelCover is hard to approximate to any constant factor]. Unfortunately, we do not know 
how to reduce from Ratio-LabelCover. However, we present a reduction starting from a ratio version 
of Unique Games. We do not know if Ratio UG is hard to approximate for the parameters we need. 
While it seems related to Unique Games with Small-set Expansion [RSIO], a key point to note is that 
we do not need 'near perfect' completeness, as in typical UG reductions. 

We hope the Fourier analytic tools we use to analyze the ratio objective could find use in other 
PCP-based reductions to ratio problems. Let us now define a ratio version of Unique Games, and a 
useful intermediate QP-Ratio problem. 

Definition 4.5 (Ratio UG ). Consider a unique label cover instance ZY(G(y,£;), [i?],{7re I e G E}). 
The value of a partial labeling L : y — )■ [i?] U {_L} (where label _L represents it is unassigned) is 

\{(n,v)(,E\i,„,„{Uu))^L{v)]\ 
""'f^* = |{»<EF|LW#X)I 

The (s,c)-Ratio UG problem is defined as follows: given c > s > (to be thought of as constants), 
and an instance U on a regular graph G, distinguish between the two cases: 

- YES: There is a partial labeling L : V ^ [R\U {-L}, such that val{L) > c. 

- NO: For every partial labeling L :¥ ^ [R]U {_L}, val{L) < s. 

The main result of this section is a reduction from (s, c)- Ratio UG to QP-ratio. We first introduce 
the following intermediate problem: 

Definition 4.6. QP-lntermediate: Given A(^nxn) with An < , maximize j^ fx l ^'^^ ^* ^ 

Note that A is allowed to have diagonal entries (albeit only negative ones) , and that the variables 
are allowed to take values in the interval [—1,1] . 

Lemma 4.7. Let A define an instance of QP-lntermediate with optimum value opt^. There exists an 
instance B of QP-Ratio on (n • m) variables, with m < maxj^^^^^^, 2n} -|- 1, and the property that its 
optimum value opt2 satisfies opt^ — e < opt2 < opt]^ -\-£. [Here \\A\\i = ^ {o^ijl-] 

Proof. The idea is to view each variable as an average of a large number (in this case, m) of new 
variables: thus a fractional value for Xi is 'simulated' by setting some of the new variables to ±1 and 
the others zero. See Appendix A. 2 for details. ■ 
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Thus from the point of view of approximabihty, it suffices to consider QP-lntermediate. We now 
give a reduction from Ratio UG to QP-lntermediate. 



Input: An instance T = (!/,£', IT) of Ratio UG , with alphabet [R]. 
Output: A QP-lntermediate instance Q with number of variables N = 
Parameters: rj := W^7i'^2^^ 


--\V\-2^. 


Construction: 




— For every vertex u € V, we have 2^ variables, indexed by x G {- 
by fu{x), and view fu as a function on the hypercube {—1, 1}^. 


1,1}^. We will denote these 


- Fourier coefficients (denoted fuiS) = 'Ex[xs{^)fu{3:)]) are linear forms in the variables fu{x)- 


- For {u,v) G E, define T^v = Y.ifu{{'i})fv{{'^uv{i)})- 




- For u G V, define L{u) = fu{Sf- 




— The instance of QP-lntermediate we consider is 




„ ^(u,v)£E Tuv — E„ L{u) 
Q := max 

^u\Ju\l 




where denotes Ex[\fuix)\]. 





Lemma 4.8. (Completeness) If the value of T is > a, then the reduction gives an instance of 
QP-lntermediate with optimum value > a. 

Proof. Consider an assignment to T of value a and for each u set fu to be the corresponding dictator 
(or = if u is assigned _L). This gives a ratio at least a (the L{u) terms contribute zero for each 
u). U 

Lemma 4.9. (Soundness) Suppose the QP-lntermediate instance obtained from a reduction (starting 
with T) has value r, then there exists a solution to T of value > t'^/C, for an absolute constant C. 

Proof. Consider an optimal solution to the instance Q of QP-lntermediate, and suppose it has a value 
T > 0. Since the UG instance is regular, we have 

Y.U E^er{n) Tuv - vHu T{u) ,^ 
val{Q) = ^ II . II • (17) 

First, we move to a solution such that the value is at least r/2, and for every u, is either 
zero, or is "not too small". The choice of r/ will then enable us to conclude that each fu is 'almost 
linear' (there are no higher level Fourier coefficients). 

Lemma 4.10. There exists a solution to Q of value at least t/2 with the property that for every u, 
either fu = Oor ||/„||i > 

Proof. Let us start with the optimum solution to the instance. First, note that > 1/2^, 

because if not, \ fu{x)\ < 1 for every u and x G {—1, 1}^. Thus if we scale all the f^s by a factor 
z > 1, the numerator increases by a z'^ factor, while the denominator only by z; this contradicts the 
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optimality of the initial solution. Since the ratio is at least r, we have that the numerator of (17) 
(denoted num) is at least t/2^. 

Now since \fuiS)\ < \\fu\\i for any S, we have that for all u,v, T^v < R ■ ||/m||i||/d||i- Thus 
^visT{u)T^uv ^ R • Thus the contribution of u s.t. ||/m||i < r/(n2^'^) to num is at most 

n X R ■ ^^.2R < 2^1+1 < num/2. Now setting all such /« = will only decrease the denominator, and 
thus the ratio remains at least t/2. [We have ignored the L[u) term because it is negative and only 
improves when we set = 0.] ■ 

For a boolean function /, we define the 'linear' and the 'non-linear' parts to be 

■■= E and :=/-/=!= J] f{S)x{S). 

i \S\^l 

Our choice of r/ will be such that: 

1. For aU u with /„ / 0, H/^^Hl < ||/«|li/10^. Using Lemma 4.10 (and the naive bound r > 1/n), 
this will hold if > 10^n''2^^. [A simple fact used here is that X^uE[Tu.y] < nR.] 

2. For each u, H/if^Hl < This will hold if r/ > n2^^ and will allow us to use Lemma A.l. 

Also, since by Cauchy-Schwarz inequality, \fu\% ^ ^u-i can conclude that 'most' of the Fourier 
weight of fu is on the linear part for every u. We now show that the Cauchy-Schwarz inequality 
above must be tight up to a constant (again, for every u). The following is the key lemma in the 
reduction: it says that if a boolean function / is 'nearly linear', then it must also be spread out 
[which is formalized by saying ||/||2 ~ This helps us deal with the main issue in a reduction 

with a ratio objective - showing we cannot have a large numerator along with a very small value of 
II /111 (the denominator). Morally, this is similar to a statement that a boolean function with a small 
support cannot have all its Fourier mass on the linear Fourier coefficients. 

Lemma 4.11. Let f : {— 1,1}'^ i— ?• [—1,1] satisfy ||/||i = S. Let f^^ and f^^ be defined as above. 
Then if \\f\\l > (10^ + 1)^2, we have ||/^i||2 > . 

Proof Suppose that [[/III > (10^ + 1)6'^, and for the sake of contradiction, that [[/^"^IH < S'^- Thus 
since ll/lli = lirii + ||/^i||i, we_have \\f=Y > {W06f. 

If we write = /({i}), then f^^{x) = OiXj, for every x G { — 1, 1}^- From the above, we have 
^^af > (1005)2. Now if |aj| > 45 for some i, we have ||/"^||i > (1/2) • 45, because the value of 
at one of x,x (B ei is at least 45 for every x. Thus in this case we have ||/^^||i > 25. 

Now suppose \ai\ < 45 for all i, and so we can use Lemma A. 2 to conclude that Px{f^^{x) > 
1005/10) > 1/4, which in turn implies that |/=i|i > (1005/10) • Px{f=\x) > 1005/10) > 25. 

Thus in either case we have ||/^^||i > 25. This gives ||/ — /^""^[[i > H/^""^!!! — ||/||i > 5, and hence 
11/ — /^"^lll > 52 (Cauchy-Schwarz), which implies H/^^lH > 5^, which is what we wanted. ■ 

Now, let us denote 5„ = |/m|i. Since T is a unique game, we have for every edge {u,v) (by 
Cauchy-Schwarz) , 



Now we can use Lemma 4.11 to conclude that in fact, Tu^v < 10^5m5„. Now consider the following 
process: while there exists a u such that 5^ > and E^gp(„) 5^ < 47^0^, set /„ = 0. We claim that 
this process only increases the objective value. Suppose u is such a vertex. From the bound on Tuv 
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above and the assumption on u, we have E^^y'{u) ^uv < • t/A. If we set fu = 0, we remove at most 
twice this quantity from the numerator, because the UG instance is regular [again, the L{u) term 
only acts in our favor]. Since the denominator reduces by 5u, the ratio only improves (it is > r/2 to 
start with). 

Thus the process above should terminate, and we must have a non-empty graph at the end. Let 
S be the set of vertices remaining. Now since the UG instance is regular, we have that 5u = 
X^„E^gr(ii) ^v- The latter sum, by the above is at least \S\ ■ r/(4 • 10^). Thus since the ratio is at 

2 

least r/2, the numerator num > \S\ ■ g:^- 

Now let us consider the following natural randomized rounding: for vertex u £ S, assign label i 
with probability Now observing that |/m({^})| < 2 for all u (Lemma A.l), 

we can obtain a solution to ratio-UG of value at least num/|S'|, which by the above is at least r^/C 
for a constant C. 

This completes the proof of Lemma 4.9. ■ 
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A Hardness of QP-Ratio 
A.l Boolean analysis 

Lemma A.l. [ABH+05] Let /„ : {-1, 1}^ [-1, 1] be a solution to Q of value r > 0. Then 

R 

yuGV 5]|7n(W)|<2. 

i=l 

Proof. Assume for sake of contradiction that fu{{i}) > 2. 

Since f^^ is a hnear function with co-efRcients {fu{{i})}, there exists some y G {—1, 1}^ such that 
fuHy) = Ei \k{i})\ > 2. For this y, we have f^\y) = f{y) - f=\y) < -1. 

Hence |/^^|^ > 2~^, which gives a negative value for the objective, for our choice of r]. ■ 

The fohowing is the weU-known Berry-Esseen theorem (which gives a quantitative version of 
central limit theorem). The version below is from [O'D]. 

Lemma A. 2. Let ai, . . . ,aR be real numbers satisfying a? = 1, and af < t for all i £ [R]. Let 
Xi be i.i.d. Bernoulli filj random variables. Then for all 9 > 0, we have 

\p[Y,a^x^>e]-N{e)\ <r, 

i 

where N{6) denotes the probability that g > 6, for g drawn from the univariate Gaussian M{0, 1). 
A. 2 Reducing QP-lntermediate to QP-Ratio 

In this section we will prove Lemma 4.7. Let us start with a simple observation 

Lemma A. 3. Let A be an n x n matrix (it could have arbitrary diagonal entries). Suppose {xi}, 
1 < i < n is the optimum solution to 

x'^Ax 
max — -. 

a;»e[-i,i] \xi\ 

Now let 6 < min{ (where \\A\\i = Then perturbing each Xi additively by 5 

(arbitrarily) changes the value of the ratio by an additive factor of at most e. 

Proof. First note that \xi\ > 1, because otherwise we can scale all the Xi by a factor z > 1 and 
obtain a feasible solution with a strictly better value [because the numerator scales by and the 
denominator only by z]. Thus changing each Xj by 5 < l/2?7- will keep the denominator between 1/2 
and 3/2. Now consider the numerator: it is easy to see that a term OijXiXj changes by at most (5|aij|, 
and thus the numerator changes by at most (5||j4||i. Thus the ratio changes by an additive factor at 
most 25||A||i < e, by the choice of 6. ■ 

Proof of Lemma ^.7. Start with an instance of QP-lntermediate given by ^(„xn)i suppose the 
optimum value is opt^. Let m be an integer which will be chosen later [think of it as sufficiently large]. 
Consider the quadratic form B on n-m variables, defined by writing Xi = ^ - (y|^^ + + • • • + yj'"'* ) , 
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and expanding out Ax. Let C be a form on (the same) n-m variables obtained from B by omitting 
the square (diagonal) terms. Now consider the QP-Ratio instance given by C . That is, 



maximize — ^-p-, subject to G {-1, 0, 1}. (19) 

Let us write an = —ai (by assumption aj > 0). The we observe that 

0-)i 



y^Cy y^By , ^ • ( 1^/^^!) 



+ (20) 



By the assumption on Qj, we have 



y^By ^ y^Cy ^ ^^^^ I " 



We prove the two inequalities separately. First let us start with an optimum solution {xi} to QP- 
Intermediate (from A) with value opt^. As above, define 5 = minjgp^, 2^}. Let us round the values 
Xi to the nearest integer multiple of 5 [for simplicity we will assume also that 1/5 is an integer]. By 
Lemma A. 3, this will change the objective value by at most e. We will choose m to be an integer 
multiple of 1/(5, thus if we set = sign{xi) for j = 1,2, .. . ,7nxi and for the rest, we obtain a 
value at least opt^ — e for the QP-Ratio problem defined by C [using the first half of (21)]. Thus 
opt2 > opt]^ — e. 

Now consider a solution to the QP-Ratio problem defined by C, and set Xi = ^ ■ {y^^^ + ■ ■ ■ yi"^^)- 
For this assignment, we have 

x'^Ax y^ By y^ By y'^Cy 



because we will choose m > Mli. This implies that opt]^ > opt2 



m 

Thus we need to choose m to be the smallest integer larger than maxj^^^^J^, 2n} for all the bounds 
to hold. This gives the claimed bound on the size of the instance. ■ 



A. 3 Towards NP-hardness - LabelCover with SSE 

The PCP theorem, [AS98, ALM"'"98] combined with the parallel repetition theorem [Raz98] yields 
the following theorem. 

Theorem A. 4 (Label Cover hardness). There exists a constant 7 > so that any 2i-SAT instance w 
and any R > 0, one can construct a Label Cover instance C, with vertices, and label set of 

size R, so that: if w is satisfiable, val(>C) = 1 and otherwise val(£) = r < R'"' . Further, C can be 
constructed in time polynomial in its size. 

Definition A. 5 ((/i, r/)-Expanding Label Cover ). An instance of the Label Cover problem is said to 
be (/i, ?7)-expanding if for every fj,' < fi, and functions / : ^ — )■ [0, 1] and g : B ^ [0,1] such that 
EaeA[f{a)] = EbMb)]=f^', 

E(a,b)eE[f{a) g{b)] < fi'v- 

Theorem A. 6. For every 6 > and rj > d'^^^ , one can convert a Label Cover instance C into an 
instance {S,r]) Expanding Label Cover C (with the same completeness vs soundness). Further, the 
size of C is at most \C\^^^ and has a label set of size at most R/6. 
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Proof. We use one of product instances used in [HK04] (which gives Mixing but not Smoothness - 
Appendix A. 2). They can argue about the expansion of only sets that are sufficiently large (constant 
fraction), while we need to work with all set sizes < 6n. 

Given an instance of Label Cover represented as T = {AuB, E, U, [R]), let = {A^UB, E'',U, [R]), 
where (1/??)^ < k < 1/6. Let g : B ^ [0,1] be any function defined on the right hand side. For each 
a £ A, let Xa denote the average value of g over the neighborhood of a. Since T is right-regular, we 
have EaeAiXa] = EbGB9{b)- 

For a vertex a = (oi, 02, • • • fflfc) € A!^ , define Ya to be the average value of g over the neighborhood 
of a (counting multiple edges multiple times). 

E [Y^] = E 

1 9A 

<T^ + 5'^ <^ (for 5 < i/fc) 

k k 

Thus, by a second moment bound, the fraction of vertices in A'' with Ya greater than t] is at most 
•iS/kn^. Thus, for any f : A^ [0, 1], such that E[/] = 6, 

< 36ri since (1/^?)"^ < k 




Ratio Label Cover. Consider the ratio version of Label Cover - the goal is to find a partial 
assignment to a Label Cover instance, which maximizes the fraction of edges satisfied, divided by the 
fraction of vertices which have been assigned labels (an edge (u, v) is satisfied iff both the end points 
are assigned labels which satisfy the constaint tTu^^). 

It follows by a fairly simple argument that Theorem A. 6 shows that Ratio Label Cover is NP-hard 
to approximate within any constant factor. We sketch an argument that shows NP-hardness of 1 vs 
7 for any constant 7 > 0. We start with Label Cover instance T with completeness 1 and soundess 
T < 7^/^ (and appropriate label size). Our Ratio Label Cover instance is essentially obtained by 
applying Theorem A. 6 : the instance has soundness r and is (7^^^, 7) expanding. The completeness 
of this instance is easily seen to be 1. 

To argue the soundess, it first suffices to only consider solutions f,g which have equal measure 
(similar to the argument in section 4.2 - the instance first has its right side duplicated so that right 
and left sizes are equal, and then upto a factor 2 loss, we only pick equal number of vertices on 
both sides). The expansion of sets of measure < 6 have value at most 7 due to the expansion of 
the instance. Suppose there is a solution of measure > 6 which has ratio label cover value 7, then 
we obtain a solution to the Label Cover instance T of value at least ■ 5 = 7^/^ > r, which is a 
contradiction. 



B APX- hardness of QP- Ratio 

We proved that QP-Ratio is hard to approximate to an 0(1) factor assuming the small-set expansion 
conjecture. Here we prove a weaker hardness result - that there is no PTAS - assuming just P 7^ NP. 
We do not go into the full details, but the idea is the following. 
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We reduce Max-Cut to an instance of QP- Ratio. The following is well-known (we can also start 
with other QP problems instead of Max-Cut) 

There exist constants 1/2 < p' < p such that: given a graph G = {V, E) which is regular 
with degree d, it is NP-hard to distinguish between 
Yes. MaxCut(G) > p ■ "'^/2, and 
No. MaxCut(G) < p' ■ nd/2. 

Given an instance G = iV^E) of Max-Cut, we construct an instance of QP-Ratio which has V 
along with some other vertices, and such that in an OPT solution to this QP-Ratio instance, all 
vertices of V would be picked (and thus we can argue about how the best solution looks). 

First, let us consider a simple instance: let abode be a 5-cycle, with a cost of for edges 
ab,bc,cd,de and —1 for the edge ae. Now consider a QP-Ratio instance defined on this graph (with 
±1 weights). It is easy to check that the best ratio is obtained when precisely four of the vertices are 
given non-zero values, and then we can get a numerator cost of 3, thus the optimal ratio is 3/4. 

Now consider n cycles, UibiCidiCi, with weights as before, but scaled up by d. Let A denote the 
vertex set {oj} (similarly B,C, ..). Place a clique on the set of vertices A, with each edge having a 
cost Wd/n. Similarly, place a clique of the same weight on E. Now let us place a copy of the graph 
G on the set of vertices C. 

It turns out (it is actually easy to work out) that there is an optimal solution with the following 
structure: (a) all Oj are set to 1, (b) all Cj are set to —1 (this gives good values for the cliques, and 
good value for the aj6j edge), (c) q are set to ±1 depending on the structure of G, (d) If Cj were set 
to bi = +1, and dj = 0; else 5^ = and di = —1 (Note that this is precisely where the 5-cycle 
with one negative sign helps!) 

Let xi, ...,Xn £ { — 1} be the optimal assignment to the Max-Cut problem. Then as above, we 
would set Ci = Xi. Let the cost of the MaxCut solution be 6 ■ "■d-/2. Then we set 4n of the 5n variables 
to ±1, and the numerator is (up to lower order terms): 

2 • (10d/n)«72 + 9 ■ ndj2 ^ 3nd = (A + e)nd, 

where A is an absolute constant. 

We skip the proof that there is an optimal solution with the above structure. Thus we have that 
it is hard to distinguish between a case with ratio (A -|- p')d/A, and (A -|- p)d/4:, which gives a small 
constant factor hardness. 

C Reduction from a weighted to an Unweighted version 

Lemma C.l. Let A be an n x m matrix, and let w > 1 be an integer. Let opti denote the optimum 
value of the problem 

x^ Ay 

max — - — — —. 

a:6{-l,0,l}",ye{-l,0,l}'" -|- ||y||i 

Let B be a wn x m matrix formed by placing w copies of A one below the other. [In terms of bipartite 
graphs, this just amounts to making w copies of the left set of vertices.] Let opt2 denote the optimum 
value of the problem 

z^By 

max -TT—r — —. 

2e{-i,o,i}'^",i/e{-i,o,i}'" \\z\\i + \\y\\i 

Then opti = opt2. 
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Proof. It is clear that opt2 > opti: simply take a solution of value opti for the first problem and 
form z by taking w copies of x. To see the other direction, let us view z as being formed of 'chunks' 
zi, . . . ,z^ of size n each. Consider a solution to the second problem of value opt2- Then 

zfAy + zjAy + ... z^Ay 

op*2 = — n — n n — n r~i~> 

ll^illi + • • • \\zw\\i + \y\ 

which implies that if we set x = Zi which gives the largest value of 2:?'Ay/||zj ||i, we obtain a value at 
least opt2 to the first problem. 

This completes the proof. ■ 
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